Apache Spark について

翻訳と辞書

Words near each other

・ Apache revolver
・ Apache Rifles
・ Apache Rocks the Bottom!
・ Apache Roller
・ Apache Rose
・ Apache Samza
・ Apache Scouts
・ Apache ServiceMix
・ Apache Shale
・ Apache Shindig
・ Apache Shiro
・ Apache Skies
・ Apache Sling
・ Apache Software Foundation
・ Apache Solr
・ Apache Spark
・ Apache Stanbol
・ Apache Stone
・ Apache Strike
・ Apache Struts
・ Apache Struts 2
・ Apache Subversion
・ Apache Synapse
・ Apache Tapestry
・ Apache Taverna
・ Apache tears
・ Apache Territory
・ Apache Thrift
・ Apache Tomcat
・ Apache TomEE

Dictionary Lists

mini英和辞書

mini和英辞書

Wikipedia English

ウィキペディア

翻訳と辞書　辞書検索 [ 開発暫定版 ]

スポンサードリンク

Apache Spark ：ウィキペディア英語版

Apache Spark

Apache Spark is an open source cluster computing framework originally developed in the AMPLab at University of California, Berkeley but was later donated to the Apache Software Foundation where it remains today. In contrast to Hadoop's two-stage disk-based MapReduce paradigm, Spark's multi-stage in-memory primitives provides performance up to 100 times faster for certain applications. By allowing user programs to load data into a cluster's memory and query it repeatedly, Spark is well-suited to machine learning algorithms.
Spark requires a cluster manager and a distributed storage system. For cluster management, Spark supports standalone (native Spark cluster), Hadoop YARN, or Apache Mesos. For distributed storage, Spark can interface with a wide variety, including Hadoop Distributed File System (HDFS),〔(Figure showing Spark in relation to other open-source Software projects including Hadoop )〕 Cassandra, OpenStack Swift, Amazon S3, (Kudu ), or a custom solution can be implemented. Spark also supports a pseudo-distributed local mode, usually used only for development or testing purposes, where distributed storage is not required and the local file system can be used instead; in such a scenario, Spark is run on a single machine with one executor per CPU core.
Spark had in excess of 465 contributors in 2014,〔(Open HUB Spark development activity )〕 making it not only the most active project in the Apache Software Foundation but one of the most active open source big data projects.
==History==
Spark was initially started by Matei Zaharia at UC Berkeley AMPLab in 2009, and open sourced in 2010 under a BSD license.
In 2013, the project was donated to the Apache Software Foundation and switched its license to Apache 2.0. In February 2014, Spark became a Top-Level Apache Project.
In November 2014, the engineering team at Databricks used Spark and set a new world record in large scale sorting.〔(Spark officially sets a new record in large-scale sorting )〕

抄文引用元・出典: フリー百科事典『ウィキペディア（Wikipedia）』
■ウィキペディアで「Apache Spark」の詳細全文を読む

スポンサードリンク

翻訳と辞書 : 翻訳のためのインターネットリソース

Copyright(C) kotoba.ne.jp 1997-2016. All Rights Reserved.